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Abstract. The c-pumpkin is the graph with two vertices linked by c > 1 parallel 
edges. A c-pumpkin-model in a graph G is a pair {A, B} of disjoint subsets of vertices 
of G, each inducing a connected subgraph of G, such that there are at least c edges 
' in G between A and B. We focus on covering and packing c-pumpkin-models in 

, a given graph: On the one hand, we provide an FPT algorithm running in time 

2'^''°'n'^'^' deciding, for any fixed c > 1, whether all c-pumpkin-models can be covered 
by at most k vertices. This generalizes known single-exponential FPT algorithms for 
Vertex Cover and Feedback Vertex Set, which correspond to the cases c = 1, 2 
respectively. On the other hand, we present a (^(log n)-approximation algorithm for 
both the problems of covering all c-pumpkin-models with a smallest number of vertices, 
and packing a maximum number of vertex-disjoint c-pumpkin-models. 



Keywords: Covering and packing; parameterized complexity; approximation al- 
' gorithm; single-exponential algorithm; iterative compression; graph minors. 
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1. Introduction 



The c-pumpkin is the graph with two vertices hnked by c > 1 parallel edges. A 
c-pumpkin-model in a graph G is a pair {^4, B} of disjoint subsets of vertices of G, 
I each inducing a connected subgraph of G, such that there are at least c edges in G 

•r:^ • between A and B. In this article we study the problems of covering all c-pumpkin- 

I models of a given graph G with few vertices, and packing as many disjoint c-pumpkin- 

models in G as possible. As discussed below, these problems generalize several well- 
studied problems in algorithmic graph theory. We focus on FPT algorithms for the 
parameterized version of the covering problem, as well as on poly-time approximation 
algorithms for the optimization version of both the packing and covering problems. 



FPT algorithms. From the parameterized complexity perspective, we study the fol- 
lowing problem for every fixed integer c > 1. 



p-c-PuMPKiN-CovERiNG (p-c-CovER for short) 

Instance: A graph G and a non-negative integer k. 
Parameter: k. 
Question: Does there exist S C V{G), \S\ < k, such that 

G \ S does not contain the c-pumpkin as a minor? 



When c = 1, the p-c-CovER problem is the p- Vertex Cover problem [2,9]. For 
c = 2, it is the ^j-Feedback Vertex Set problem [7, 11, 20]. When c = 3, this 
corresponds to the recently introduced p-DiAMOND Hitting Set problem [15]. 

The p-c-CovER problem can also be seen as a particular case of the following problem, 
recently introduced by Fomin et al. [18] and studied from the kernelization perspective: 
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Let be a finite set of graphs. In tlie p-J^-CovER problem, we are given an n-vertex 
graph G and an integer k as input, and asked whether at most k vertices can be deleted 
from G such that the resulting graph does not contain any graph from as a minor. 
Among other results, it is proved in [18] that if J-" contains a c-pumpkin for some c > 1, 
then p-J^-CovER admits a kernel of size 0{k'^ log^^^ k). As discussed in Section [3l this 
kernel leads to a simple FPT algorithm for p-J^-CovER in this case, and in particular 
for p-c-CovER, with running time 2^'^^^°^^'^ ■ n^'^^\ A natural question is whether there 
exists an algorithm for p-c-CovER with running time 2^^^^ ■ n'-^^^^ for every fixed c > 1. 
Such algorithms are called single- exponential. For the Vertex Cover problem the 
existence of single-exponential algorithms is well-known since almost the beginnings of 
the field of Parameterized Complexity [2], the best current algorithm being by Chen 
et al. [9]. On the other hand, the question about the existence of single-exponential 
algorithms for p-FEEDBACK VERTEX Set was open for a long time and was finally 
settled independently by Quo et al [20] (using iterative compression) and by Dehne et 
al. [11]. The current champion algorithm for p-FEEDBACK Vertex Set runs in time 
0(3.83'^fc-n2) [7]. 

We present in Section [3] a single-exponential algorithm for p-c-CovER for every fixed 
c > 1, using a combination of a kernelization-like technique and iterative compression. 
Notice that this generahzes the above results for p- Vertex Cover and p-Feedback 
Vertex Set. We remark that asymptotically these algorithms are optimal, that is, it 
is known that unless ETH fails neither p- Vertex Cover nor p-Feedback Vertex 
Set admit an algorithm with running time 2°^'^^ • n'^'-^-' [8,21]. It is worth to mention 
here that a similar quantitative approach was taken by Lampis [23] for graph problems 
expressible in MSOL parameterized by the sum of the formula size and the size of a 
minimum vertex cover of the input graph. 



Approximation algorithms. For a fixed integer c > 1, we define the following two 
optimization problems. 



Minimum c-Pumpkin-Covering (Min c-Cover for short) 
Input: A graph G. 
Output: A subset 5 C T/(G) such that G\S 

does not contain the c-pumpkin as a minor. 
Objective: Minimize IS*]. 



Maximum c-Pumpkin-Packing (Max c-Packing for short) 
Input: A graph G. 
Output: A collection M of vertex-disjoint subgraphs of G, 
each containing the c-pumpkin as a minor. 
Objective: Maximize 



Let us now discuss how the above problems encompass several well-known problems. 
For c = 1, Min 1-Cover is the Minimum Vertex Cover problem, which can be 
easily 2-approximated by finding any maximal matching, whereas Max 1-Packing cor- 
responds to finding a Maximum Matching, which can be done in polynomial time. 
For c = 2, Min 2-Cover is the Minimum Feedback Vertex Set problem, which 
can be also 2-approximated [1,3], whereas Max 2- Packing corresponds to Maximum 
Vertex-Disjoint Cycle Packing, which can be approximated to within a 0{logn) 
factor [22]. For c = 3, MiN 3-CovER is the Diamond Hitting Set problem studied 
by Fiorini et al. in [15], where a 9-approximation algorithm is given. 
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We provide in Section U] an algorithm that approximates both the MiN c-Cover 
and the Max c- Packing problems to within a 0{logn) factor for every fixed c > 1. 
Note that this algorithm matches the best existing algorithms for Max c-Packing for 
c = 2 [22]. For the MiN c-Cover problem, our result is only a slight improvement on the 
©(log^/^ n)-approximation algorithm given in [18]. However, for the MAX c- Packing 
problem, there was no approximation algorithm known before except for the c < 2 case. 
Also, let us remark that, for c > 2 and every fixed e > 0, Max c-Packing is quasi- 
NP-hard to approximate to within a C'(log^''^~'' n) factor: For c = 2 this was shown by 
Friggstad and Salavatipour [19], and their result can be extended to the case c > 2 in 
the following straightforward way: given an instance G of Max 2-Packing, we build 
an instance of Max c-Packing by replacing each edge of G with c — 1 parallel edges. 

The main ingredient of our approximation algorithm is the following combinatorial 
result: We show that every n- vertex graph G either contains a small c-pumpkin- model 
or has a structure that can be reduced in polynomial time, giving a smaller equivalent 
instance for both the MiN c-CovER and the MAX c-Packing problems. Here by a 
"small" c-pumpkin-model, we mean a model of size at most /(c) -logn for some function 
/ independent of n. This result extends one of Fiorini et al. [15], who dealt with the 
case c = 3. 

2. Preliminaries 

Graphs. We use standard graph terminology, see for instance [12]. All graphs in this 
article are finite and undirected, and may have parallel edges but no loops. We will 
sometimes restrict our attention to simple graphs, that is, graphs without parallel edges. 

Given a graph G, we denote the vertex set of G by V{G) and the edge set of G 
by E{G). We use the shorthand |G| for the number of vertices in G. For a subset 
X C V{G), we use G[X] to denote the subgraph of G induced by X. For a subset 
Y C E{G) we let G[Y] be the graph with E{G[Y]) := Y and with V{G[Y]) being the 
set of vertices of G incident with some edge in Y. For a subset X C V[G), we may use 
the notation G\X to denote the graph G[V{G) \ X]. 

The set of neighbors of a vertex of a graph G is denoted by Ng{v). The degree 
deg(j(f) of a vertex v G V{G) is defined as the number of edges incident with v (thus 
parallel edges are counted). We write deg^(7;) for the number of neighbors of v, that is, 
degQ{v) := \Ng{v)\. Similarly, given a subgraph H G with v G V{H), we can define in 
the natural way Nh{v), degjj{v), and deg*f^{v), that is, Nh{v) = NG{v)r\V{H), degfj{v) 
is the number of edges incident with v with both endpoints in H, and deg*f^{v) = \Nh{v)\. 
In these notations, we may drop the subscript if the graph is clear from the context. The 
minimum degree of a vertex in a graph G is denoted S{G), and the maximum degree 
of a vertex in G is denoted A(G). We use the notation cc(G) to denote the number of 
connected components of G. Also, we let n{G) denote the maximum multiplicity of an 
edge in G. 

Minors and models. Given a graph G and an edge e E E{G), let G\e be the graph 
obtained from G by removing the edge e, and let G/e be the graph obtained from G 
by contracting e (we note that parallel edges resulting from the contraction are kept 
but loops are deleted). If H can be obtained from a subgraph of G by a (possibly 
empty) sequence of edge contractions, we say that H is a minor of G, and we denote 
it by H :<m G. A graph G is H -minor-free, or simply H-free, if G does not contain 
H as a minor. A model of a graph H, or simply H-model, in a graph G is a collection 
{Sv C V{G) I V £ V{H)} such that 

(i) G[Si,] is connected for every v £ V{H)] 



4 



G. JORET, C. PAUL, I. SAU, S. SAURABH, AND S. THOMASSE 



(ii) Sy and Sw are disjoint for every two distinct vertices v, w of H; and 

(iii) there are at least as many edges between Sy and in G as between v and w 
in H, for every vw £ E[H). 

The size of the model is defined as Ylvi^viH) \Sv\- Clearly, H \s a, minor of G if and only 
if there exists a model of H in G. In this paper, we will almost exclusively consider 
-ff- models with H being isomorphic to a c-pumpkin for some c > 1. Thus a c-pumpkin- 
model in a graph G is specified by an unordered pair {A, of disjoint subsets of vertices 
of G, each inducing a connected subgraph of G, such that there are at least c edges in 
G between A and B. A c-pumpkin-model {A,B} of G is said to be minimal if there is 
no c-pumpkin-model {A', B'} of G with yl' C ^, C 5, and + \B'\ < \A\ + \B\. 

A subset X of vertices of a graph G such that G\X has no c-pumpkin- minor is called 
a c-pumpkin-hitting set, or simply c-hitting set. We denote by Tc{G) the minimum size of 
a c-pumpkin-hitting set in G. A collection of vertex-disjoint subgraphs of a graph G, 
each containing a c-pumpkin-model, is called a c-pumpkin-packing, or simply c-packing. 
We denote by i^dG) the maximum size of a c-pumpkin-packing in G. Obviously, for any 
graph G it holds that fc(G) < Tc(G), but the converse is not necessarily true. 

The following lemma on models will be useful in our algorithms. The proof is straight- 
forward and hence is omitted. 

Lemma 2.1. Suppose G' is obtained from a graph G by contracting some vertex- disjoint 
subgraphs of G, each of diameter at most k. Then, given an H-model in G' of size s, 
one can compute in polynomial time an H-model in G of size at most k ■ A{H) ■ s. 

Parameterized algorithms. See for instance [13] for an introduction to Parameterized 
Complexity. A parameterized problem IT is a subset of F* x N for some finite alphabet F. 
An instance of a parameterized problem consists of a pair {x,k), where k is called the 
parameter. A central notion in parameterized complexity is fixed parameter tractability 
(FPT), which means, for a given instance {x,k), solvability in time f{k) -p^xl), where 
/ is an arbitrary function of k and p is a polynomial in the input size. 

A kernelization algorithm or, in short, a kernel for a parameterized problem IT C 
F* X N is an algorithm that given (x, A:) G F* x N outputs in time polynomial in \x\ + k 
a pair {x',k') e F* x N such that 

(i) (x, fc) G n if and only if {x' , k') £ H; and 

(ii) maxjlx'l, A;'} < g{k), 

where g is some computable function. The function g is referred to as the size of the 
kernel. If g{k) = k^^^^ or g{k) = 0{k), then we say that II admits a polynomial kernel 
and a linear kernel, respectively. 

Iterative compression is a tool that has been used successfully in finding fast FPT 
algorithms for a number of parameterized problems. The main idea behind iterative 
compression is an algorithm which, given a solution of size + 1 for a problem, either 
compresses it to a solution of size k or proves that there is no solution of size k. This 
technique was first introduced by Reed et al. to solve the Odd Cycle Transversal 
problem [25] , where one is interested in finding a set of at most k vertices whose deletion 
makes the graph bipartite [25]. Since then, it has been extensively used in the literature, 
see for instance [10,17,20]. 

Tree-width. We briefly recall the definition of the tree-width of a graph. A tree de- 
composition of a graph G is an ordered pair (T, {Wx \ x G V{T)}) where T is a tree and 
{Wx I X £ V{T)} a family of subsets of V{G) (called bags) such that 

(i) UxeV(T)W. = V{G); 

(ii) for every edge uv € E{G), there exists x G V{T) with u,v £ Wx', and 



HITTING AND HARVESTING PUMPKINS 



5 



(iii) for every vertex u G V{G), the set {x G V{T) \ u G Wx} induces a subtree of T. 
The width of tree decomposition {T,{Wx \ x £ V(T)}) is max{ | T^/'a; | -1 \ x £ V{T)}. 
The tree-width tw(G) of G is the minimum width among all tree decompositions of G. 
We refer the reader to Diestel's book [12] for an introduction to the theory of tree-width. 
It is an easy exercise to check that the tree-width of a simple graph is an upper bound 
on its minimum degree. This implies the following lemma. 

Lemma 2.2. Every n-vertex simple graph with tree-width k has at most k ■ n edges. 

We will need the following result of Bodlaender et al [6] . 

Theorem 2.3 (Bodlaender et al. [6]). Every graph not containing a c-pumpkin as a 
minor has tree-width at most 2c — 1. 

The following corollary is an immediate consequence of the above theorem. 

Corollary 2.4. Every n-vertex graph with no minor isomorphic to a c-pumpkin has at 
most (c — 1) • (2c — 1) • n edges. 

3. A SINGLE-EXPONENTIAL FPT ALGORITHM 

As mentioned in the introduction, it is proved in [18] that given an instance (G, k) 
of p-J^-CovER such that contains a c-pumpkin for some c > 1, one can obtain in 
polynomial time an equivalent instance of size 0{k^ log^^"^ k). This kernel leads to the 
following simple FPT algorithm for p-J^-CovER: First compute the kernel K in poly- 
nomial time, and then for every subset S C V{K) of size k, test whether K[V{K) \ S] 
contains any of the graphs in as a minor, using the poly-time algorithm of Robertson 
and Seymour [26] for every graph in J^. If for some S we have that K[V{K)\S] contains 
no graph from as a minor, we answer Yes; otherwise the answer is No. The running 
time of this algorithm is clearly bounded by . n^W = 2C?(fci°s^-) . n^W. 

In this section we give an algorithm for p-c-PuMPKiN-CovERiNG that runs in time 
any fixed c > 1, where d only depends on the fixed constant c. Towards this, 
we first introduce a variant of p-c-PuMPKiN-CovERiNG, namely p-c-PuMPKiN-DiSJOiNT 
Covering, or p-c-Disjoint Cover for short. In p-c-DisJOiNT Cover, apart from a 
graph G and a positive integer k, we are also given a set S of size at most A; + 1 such 
that G\S does not contain the c-pumpkin as a minor. Here the objective is to find a set 
S' C V{G) \ S such that \S'\ < k and G\S' does not contain the c-pumpkin as a minor. 
Next we show a lemma that allows us to relate the two problems mentioned above. 

Lemma 3.1. //p-c-DiSJOiNT Cover can be solved in time ri{cY ■n^^^\ then p-c-CovER 
can be solved in time (?7(c) + 1)*^ • n'-^^^\ 

Proof. We order the vertices of V{G) as vi,V2, . . . , Vn and define Vi = {vi, . . . , vi} for 
every 1 < i < n. Notice that if G has a c-hitting set S of size k then S r\Vi is a 
c-hitting set of G[Vi] for every i < n. Furthermore, if S is c-hitting set of G[Vi] then 
5U {vi^i} is a c-hitting set of G[Vi+i]. Finally, Vk is a c-hitting set of size k of G[Vfc]. 
These three facts together with the r]{c)^ ■n'-^^^^ algorithm for p-c-DiSJOiNT Cover give 
a {r]{c) + 1)'^ • Ji'^^^) time algorithm for p-c-CovER as follows. 

Given (G[Vfc+i], Vk+i, k) we guess the intersection of our prospective S' with Vk+i and 
recurse. That is, for every subset Y C Vk+i such that \Y\ < k we want to check whether 
there exists a set Z C V^+i \ Y such that y U Z is a c-hitting set of size at most k for 
G[Vi+i]. We first check whether G[Vfe_|_i \ Y] is c-pumpkin- free; if it is then we get an 
instance {G[Vk+i \ Y],Vk+i \ Y,k — \Y\) of p-c-DiSJOiNT Cover. On this instance we 
call an algorithm for p-c-DiSJOiNT Cover. If for every Y the algorithm returns that 
(G[Vfc4-i \Y],N,k — \Y\) has no c-hitting set of size at most k, then we return that G 
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has no fe-sized c-hitting set. Else there exists a set Y for which the algorithm returns a 
c-hitting set Z of size at most k — \ Y\, call Z yjY as S^+i- Observe that the running 



time of this step is bounded by C'|^)(??(c) + 1)''"' • n^^^) = (r/(c) + 1)'= • n^(i). 



We repeat the steps described in the second paragraph, now with input 
{G\Vk+2]i Sk+i U {vk+2}-,k). Again, we either receive a "no" answer or a /c-sized c- 
hitting set Sk+2 of G[Vfc+2] and again, if the answer is negative then G has no /c-sized 
c-hitting set. Otherwise we call the algorithm described in the second paragraph with 
input (G, Sk+2 U {ffc+s}, k) and keep going on in a similar manner. If we receive a neg- 
ative answer at some step we answer that G has no fc-sized c-hitting set. If we do not 
receive a negative answer at any step, then after n — k rounds we have a /c-sized c-hitting 
set of = G. Thus we have resolved the input instance in time (??(c) + -nP^^^ . □ 

Lemma 13.11 allows us to focus on p-c-DiSJOiNT Cover. In what follows we give an 
algorithm for p-c-DiSJOiNT Cover that runs in single-exponential time. 

Overview of the algorithm. The algorithm for p-c-DiSJOiNT Cover is based on 
a combination of branching and poly-time preprocessing. Let (G, S, k) be the given 
instance of p-c-DiSJOiNT Cover and let V := V{G). Our main objective is to eventually 
show that A := {v ^ V \ S : Nc{v) r\ S ^ has cardinality 0{k). As far as we 
cannot guarantee this upper bound, we will see that we can branch suitably to obtain 
a few subproblems. Once we have the desired upper bound, we use a protrusion-based 
reduction rule from [18] to give a poly-time procedure that given an instance (G, 5", k) of 
p-c-DiSJOiNT Cover, returns an equivalent instance {G',S,k') such that G' has 0{k) 
vertices. That is, we obtain a linear vertex kernel for p-c-DlSJOiNT Cover in this 
particular case. Notice that once we have a linear vertex kernel of size ak for p-c- 
DiSJOiNT Cover, we can solve the problem in (^^) ■ k^^^\ So the overah algorithm 
consists of a recursion tree where in leaf nodes we obtain a linear kernel and then solve 
the problem using brute force enumeration. 

We can now proceed to the formal description of the algorithm. Let 



Linear kernel. We start of with a procedure, called protrusion rule, that bounds the 
size of our graph when |y4 = Vi U V>2| = 0{k). Given R C V{G), we define dc{R) as 
the set of vertices in R that have a neighbor in V{G) \ R. Thus the neighborhood of 
R is NciR) = dciViG) \ R). We say that a set X C V{G) is an r-protrusion of G if 
tw(G[A']) < r and |9g(A')| < r. We now state the protrusion rule. 

P Let {G, S, k) be an instance and let 7 : N — )• N be the function defined in [18, 
Lemma 5] (originally shown in [4]). If G contains a 4c-protrusion Y of size at 
least 7(4c), then replace Y to obtain an equivalent instance (G*, S, k*) such that 



The proof of the next lemma is identical to the proof of [18, Theorem 1], we provide 
it here for the sake of completeness. 

Lemma 3.2. // \ A = Vi^J V>2\ = 0{k), then p-c-DiSJOiNT Cover has a kernel with 
0{\A\) = 0{k) vertices. 

Proof. Let {G,S,k) be an instance of p-c-DiSJOiNT Cover such that |^| is bounded 
by 0{k). We apply [18, Lemma 4] and obtain in linear time a 2((2c — 1) -|- 1) = 4c- 
protrusion y of G of size at least ^^^|^^|^~J'^^ in G \ 5. Let 7 : N — )■ N be the function 

defined in [18, Lemma 5] (originally shown in [4]). If > 7(4c), then using [18, 



Vi 

V>2 



{veV\S:\NG{v)nS\=l} 
{veV\S: \NG{v)nS\ > 2}. 



\V{G*)\ < \V{G)\. 
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Lemma 5] we replace the 4c-protrusion y in G and obtain an instance {G*,S,k*) such 
that |1^(G'*)| < |V^(G)|, k* < k, and {G*,S,k*) is a YES-instance of p-c-DiSJOiNT Cover 
if and only if (G, S, k) is a YES-instance of p-c-DiSJOiNT Cover. In the whole process 
we never delete either the vertices of S or yl. 

Let {G*,S,k*) be the reduced instance. In other words, there is no 4c-protrusion of 
size 7(4c) in G* \ S, and protrusion rule P no longer applies. We claim that the number 
of vertices in this graph is bounded by 0{k). Indeed, since we cannot apply protrusion 
rule P, we have that ^ t('^c)- Because k* < k, we have that \V{G*)\ < 

7(4c)(4|^| + 1) + \S\. Since \A\ = 0{k) and \S\ < k we have that \ViG*)\ = 0{k). This 
completes the proof. □ 

Branching procedure. For our branching algorithm we take the following measure: 
(1) n = cciG[S]) + k. 

For simplicity we call the vertices in Vi white. We start with some simple reduction 
rules (depending on c) that will be applied in the compression routine. We also prove, 
together with the description of each rule, that they are valid for our problem: 

Rl Let C be a connected component of G[y \ 5]. If C does not have any neighbor in 
S, we can safely delete G, as its vertices will never participate in a c-pumpkin- 
model. 

R2 Let G be a connected component of G[V \{SL) {v})] for some vertex v £ V \ S. 

If G does not have any neighbor in S, we can safely delete G, as its vertices will 

never participate in a minimal c-pumpkin-model. 
R3 Let G be a connected component of G[V \ S]. If G has exactly one neighbor v 

in S and G[V{C) U {v}] is c-pumpkin-free, we can safely delete it, as its vertices 

will never participate in a minimal c-pumpkin-model. 
R4 Let Bhe a connected induced subgraph (not necessarily a connected component) 

of G[V \ S], let V £ V{B), and let Pi, . . . , Pi be the connected components of 

G[V{B) \ {v}]. Assume that i > c and that the following conditions hold: 

(i) For 1 < i < ^, no vertex in Pi has a neighbor in y \ (S U V{Pi) U {v}). 

(ii) For I < i < i, there exists a vertex Ui £ S such that ^weV{Pi)^Giw) Ci S = 

{Ui}. 

(iii) For 1 < i < i, G[V{Pi) U {ui}] is c-pumpkin-free. 

(iv) The vertices ui, . . . , belong to the same connected component D of G[S]. 
Then we include vertex v in the solution and we decrease the parameter by 

one. Indeed, note that as ^ > c and by conditions (ii) and (iv), G[V{B) U V{D)] 
contains a c-pumpkin, so any solution needs to contain at least one vertex of B, 
since we are looking for a solution that does not include vertices from S. On the 
other hand, by condition (iii) every minimal c-pumpkin-model intersecting B 
necessarily contains vertex v, and by condition (i) the deletion of v guarantees 
that no minimal c-pumpkin-model of G \ {v} contains a vertex of B. There- 
fore, we can safely include vertex v in the solution and decrease the parameter 
accordingly. See Fig. [1] for an illustration for c = 4. 

We say that the instance (G, S, k) is (5, c)-reduced if rules Rl, R2, R3, or R4 cannot 
be applied anymore. Note that these reduction rules can easily be applied in polynomial 
time. 

We now describe a branching rule that will be used in our compression routine: 

B let P be a simple path in G[V \ S] with |F(-P)| = £ and let vi and V2 be the 
endpoints of P. Suppose that vi (resp. V2) has a neighbor in a connected 
component Gi (resp. G2) of G[S], with Gi ^ G2. Then we branch for all 2^ 
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Figure 1 . Illustration of reduction rule R4 for c = 4. 

subsets of vertices in P. Note that in every case we decrease the measure of the 
iterative compression, as either we include at least one vertex in the solution or 
we decrease the number of connected components of G[S]. 
The branching rule above will be used on paths P that are "small enough". The 

following Lemmas 13.31 and 13.61 are key to our algorithm. We also need two intermediate 

technical results, which will be used in the proof of Lemma 13.61 

Lemma 3.3. There is a function f{c) such that if {G,S,k) is an {S,c) -reduced Yes- 
instance to the p-c-DiSJOiNT Cover problem, then |V>2| < /(c) • k. 

Proof. In order to upper-bound |^>2|5 we build from G[S\ the following auxiliary graph 
H: we start with H = {S,E{G[S])), and for each vertex v G V>2 with neighbors 
ui, . . . ,U£, £ > 2, we add to H an edge e„ between two arbitrary neighbors ui, U2 of v. 
Note that H G, and that for each vertex v E V>2, H\ey :<m G\v. We now argue 
that \E{H)\ is linearly bounded by k, which implies the desired result as by construction 
|V>2| < \E{H)\. If G is a YES-instance, there must be a set S' C V\S, \S'\ < k, such that 
G\S' is c-pumpkin-free. By construction of H, the removal of each vertex v £ S' Ci V>2 
corresponds to the removal of the edge Cy S E{H). Let H' = {H \ e^ : v £ S' Ci V>2}, 
and note that H' <rn G \ S' , so H' is c-pumpkin-free. Therefore, by Corollary 12.41 
it follows that \E{H')\ < (c - 1) • (2c - 1) • \V{H')\ < (c - 1) • (2c - 1) ■ {k + 1). As 
\E{H)\ < weconcludethat |V>2| < \E{H)\ < {c-l)-{2c-l)-{k + l) + k. □ 

Lemma 3.4. There is a function g[c) such that if{G,S,k) is an {S,c)-reduced YES- 
instance to the p-c-DiSJOiNT Cover problem and C is a collection of disjoint connected 
subgraphs of G[V \ S] such that each subgraph has at least two distinct neighbors in S, 
then \C\ < g{c) ■ k. 

Proof. The proof is very similar to the proof of Lemma [3.3l In order to upper-bound 
we build from G[S] the following auxiliary graph H: we start with H = {S,E{G[S])), 
and for each subgraph G £ C with neighbors ui, . . . ,ui, £ > 2, we add to H an edge 
ec between two arbitrary neighbors ui,U2 of G. Note that H <rn G, and that for each 
subgraph G £ C, H \ ec G \ G. We now argue that \E{H)\ is linearly bounded 
by k, which implies the desired result as by construction \C\ < \E{H)\. If G is a YES- 
instance, there must be a set S' O V \ S, \S'\ < k, such that G\ S' is c-pumpkin-free. 
By construction of H, the removal of a vertex f in a subgraph G £ C corresponds to the 
removal of at most an edge ec £ E{H) (as maybe the edge ec can still be simulated after 
the removal of v). Let H' be the subgraph obtained from H after the removal of those 
edges, and note that H' <rn G\S', so H' is c-pumpkin-free. Therefore, by Corollarv 12.41 
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it follows that \E{H')\ < (c - 1) • (2c - 1) • \V{H')\ < (c - 1) • (2c - 1) • (A; + 1). As 
\E{H)\ < \E{H')\+k, we conclude that \C\ < \E{H)\ < (c- 1) • (2c- 1) • (A; + 1) + fc. □ 

Lemma 3.5. In an {S , c) -reduced instance {G,S,k), the number of connected compo- 
nents ofG[V\S] is 0{k). 

Proof. Note that by reduction rules Rl and R3, we can assume that each connected 
component C of G[V\S'] has some neighbor in 5", and that if C has exactly one neighbor 
V in S, then G[y(C)U{t;}] has a c-pumpkin. We now distinguish two types of connected 
components of G[y \ 5]: 

• The number of components C that have exactly one neighbor vvaS and such that 
G\y{C) U {u}] contains the c-pumpkin as a minor is at most A;, as any solution 
needs to contain at least one vertex from each such connected component. 

• The number of components that have at least two neighbors in S is 0{k) by 
Lemma 13.41 

This concludes the proof. □ 

Now we prove our key structural lemma. 

Lemma 3.6. There is a function h[c) such that if{G,S,k) is an (S, c)-reduced Yes- 
instance to the p-c-Dis JOINT Cover problem, then either \Vi\ < h{c) ■ k, or we can 
apply protrusion rule P, or we can branch according to branching rule B. 

Proof. We proceed to find a packing of disjoint connected subgraphs V = {-Bi, . . . , -B^} 
of G\V \ S] containing all white vertices except for 0{k) of them. This will help us 
in bounding \Vi\. We call the subgraphs in V blocks. For a graph H C G\V \ S\, let 
w{H) be the number of white vertices in H. The idea is to obtain, as far as possible, 
blocks B with c < w{B) < c^; we call these blocks suitable, and the other blocks are 
called non-suitable. If at some point we cannot refine the packing anymore in order to 
obtain suitable blocks, we will argue about its structural properties, that will allow us to 
either bound the number of white vertices, or to apply protrusion rule P, or to branch 
according to branching rule B. 

We start with V containing all the connected components C of G[y \ 5"] such that 
w{C) > c^, and we recursively try to refine the current packing. By Lemma 13. 5| we 
know that the number of connected components is 0{k), and hence the number of white 
vertices, that are not included in V is 0{c^k) = 0{k). 

More precisely, for each block B with w{B) > c^, we build a spanning tree T of B, 
and we orient each edge e € E{T) towards the components of T\{e} containing at least 
c white vertices. (Note that, as w{B) > c^, each edge gets at least one orientation, and 
that edges may be oriented in both directions.) If some edge e G E{T) is oriented in 
both directions, we replace in V block B with the subgraphs induced by the vertices 
in each of the two subtrees. We stop this recursive procedure whenever we cannot find 
more suitable blocks using this orientation trick. Let V be the current packing. 

Now let i? be a non-suitable block in V, that is, w{B) > c^ and no edge of its spanning 
tree T is oriented in both directions. This implies that there exists a vertex v S ViT) 
with all its incident edges pointing towards it. We call such a vertex v a sink. Let 
Ti, . . . ,Tp he the connected components of T \ {v}. Note that as is a sink, w(Ti) < c 
for 1 < i < p, using the fact that w{B) > c^ we conclude that p > c^. Now let Pi, . . . , 
be the connected components of G[V{Ti) U ••• U V{Tp)] = G[V{B) \ {v}], and note 
that i < p. We call these subgraphs Pi the pieces of the non-suitable block B. For 
each non-suitable block, we delete the pieces with no white vertex. This completes the 
construction of V. The next claim bounds the number of white vertices in each piece of 
a non-suitable block in V. 



10 



G. JORET, C. PAUL, I. SAU, S. SAURABH, AND S. THOMASSE 



Claim 1. Each of the pieces of a non-suitable block contains less than (? white vertices. 

Proof. Assume for contradiction that there exists a piece P of a non-suitable block 
with w{P) > (? , and let v be the sink of the non-suitable block obtained from tree T. 
By construction V{P) is the union of the vertices in some of the trees in T \ {?;}; let 
without loss of generality these trees he Ti, . . . ,Tq. As w{P) > (? and w{Ti) < c for 
1 < ^ < 9, it follows that q > c. As v has at least one neighbor in each of the trees 
Ti, 1 < i < q, and P is a connected subgraph of G, we can obtain a c-pumpkin-model 
{A,B} in G[V \ S] by setting A := {v} and B := V{P), contradicting the fact that 
G[V \ S] is c-pumpkin-free. □ 

Hence in the packing V we are left with a set of suitable blocks with at most white 
vertices each, and a set of non-suitable blocks, each one broken up into pieces linked 
by a sink in a star-like structure. By Claim [H each piece of the remaining non-suitable 
blocks contains at most white vertices. 

Now we need two of claims about the properties of the constructed packing. 

Claim 2. In the packing V constructed above, the number of suitable or non-suitable 
blocks is 0{k). 

Proof. We first bound the number of suitable blocks and for this we distinguish between 
two types of suitable blocks: 

• Type 1: blocks that have exactly one neighbor in S. For each such block we 
need to include a vertex of it in the c-hitting set (as each suitable block contains 
at least c white vertices), so obviously their number is at most k. 

• Type 2: blocks that have at least two distinct neighbors in S. The number of 
such blocks is 0{k) by Lemma |3.4[ 

The proof for non-suitable blocks is similar. We distinguishing between the same two 
types of blocks, and use the fact that each non-suitable block contains at least c' > c 
white vertices. This concludes the proof. □ 

Claim 3. In an {S,c) -reduced instance, either the total number of pieces in the packing 
V constructed above is 0{k) or we can branch according to branching rule B. 

Proof. We first distinguish between three types of pieces: 

• Type 1: pieces that have at least two distinct neighbors in S. The number of 
pieces of this type is 0{k) by Lemma 13.41 

• Type 2: pieces that are not of type 1 and that have at least one neighbor in some 
suitable block or in another non-suitable block (note that by construction a piece 
cannot have any neighbor in other pieces in the same non-suitable block). We 
construct an auxiliary graph H as follows: we start with the packing V, and we 
add all the edges in G\V\S] between vertices in different blocks oiV (suitable or 
non-suitable). Then we contract each block to a single vertex, and let H be the 
resulting graph. By Claim [3 \V{H)\ = 0{k). As H <rn G[V \ S] and G[V \ S] 
is c-pumpkin-free, by Corollarv 12.41 we have that \E{II)\ = 0{k). The bound 
follows from the fact the number of pieces of Type 2 is at most 2\E[II)\, as each 
piece is incident with at least one edge of H after uncontracting the blocks. 

• Type 3: the remaining pieces. That is, these are pieces P that see exactly one 
vertex u in 5, and that are connected to the rest of G'[y \ S] only through the 
corresponding sink v. In other words, such a piece P is a connected component 
of G \ {u, v}. Let 

a{P) := max{c' : G[V{P) U {u}] has a c'-pumpkin-minor}. 

We distinguish two cases: 
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o If a{P) > c, then any c-hitting set needs to contain at least one vertex in 
P. We conclude that the number of pieces of this subtype is at most k. 

o Otherwise, a{P) < c. Now we only focus on pieces of this subtype. We 
again distinguish two cases. 

• Assume first that for some sink v, there are two pieces Pi and P2 in its 
block that have neighbors in distinct connected components of G[S], 
say u and w respectively. If |Pi U P2I is small, then we can branch 
according to branching rule B. Otherwise we apply protrusion rule 
P to reduce the size of the Pi to 0{1). Observe that Pi contains at 
most c' < c white vertices. As tw(Pj U {v}) < 2c and \d{Pi U {v})\ < 
c' + 1, Pj U {v} is a 2c-protrusion. Now we can replace (G, S, k) by 
another equivalent graph {G' , S, k') {k' < k) such that the graph Pi is 
replaced by P[, it contains all the white vertices, and the size of P[ is 
0(1). Consider a connected components of P{, say C. If |5(C)| < 1, 
then d{C) is either v or u. In either case the conditions of reduction 
rule R2 or R3 hold and we can delete C. So either we can remove 
all connected components or there exists a connected component such 
that d{C) = {u,v}. In the first case we will get the following graph 
(G \ Pi, S, k') and hence we have made progress. If the second case 
occurs then we do as follows. 

Now we do the same with P2 U {f } as we did for Pi U {f } but on 
the initial input {G,S,k) instead of {G',S,k'). In this case either 
we can get rid of P2 and obtain an equivalent instance or we have 
a connected component G' in Pj such that d{G') = {v^w}. So in 
the case we apply protrusion rule P on Pi U {f } and P2 U {v} and 
get two connected components C and G' with d{C) = {u, v} and 
d{C') = {f,u)}, respectively, we apply the reduction rules on both 
Pi U {v} and P2 U {v}. Let {G*,S,k*) be the reduced instance and 
C and G' be the connected components as described above. Now 
the number of vertices in each connected component is 0(1). Notice 
that there exists a path P in G[G L) G' L) {v}] of constant size whose 
endpoints see different connected components of G[S]. Therefore, we 
can branch according to branching rule B. 

If we do not branch then we repeat this step, in which case we ei- 
ther branch eventually or all the pieces associated with this sink have 
neighbors in the same connected component of G[S]. 

• Otherwise, the pieces associated with each sink have neighbors in the 
same connected component of G[S]. We finally distinguish two more 
cases. 

o Assume that in some non-suitable block with sink v there are 
more than c pieces connected to the same connected component 
of G[S]. As a(P) < c and because we are dealing with pieces 
of Type 3, all the conditions of reduction rule R4 are fulfilled 
and therefore vertex v can be safely included in the solution. 
Therefore, this case does not occur in an {S, c)-reduced instance. 

o Otherwise, the number of pieces of each non-suitable block is at 
most c. As each piece contains at most white vertices and the 
total number of non-suitable blocks is 0{k) by Claim [21 the total 
number of white vertices in non-suitable blocks of this subtype 
is Oik). 
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So when we do not branch the number of pieces is 0{k), concluding the proof. □ 

Note that the proof of Claim [3] shows that if the number of pieces is not 0{k), then 
we can apply protrusion rule P (and possibly reduction rules R2 and R3) to eventually 
branch according to branching rule B. 

To conclude, recall that the constructed packing V contains all but 0{k) white ver- 
tices, either in suitable blocks or in pieces of non-suitable blocks. As by construction 
each suitable block has at most white vertices and by Claim [5] the number of such 
blocks is 0{k), it follows that the number of white vertices contained in suitable blocks 
is 0{k). Similarly, by Claim [1] each piece contains at most (? white vertices, and the 
total number of pieces is 0{k) by Claim [3] unless we can branch according to branch- 
ing rule B, so if we cannot branch the number of white vertices contained in pieces of 
non-suitable blocks is also 0{k). □ 

Final algorithm. Finally we combine everything to obtain the following result. 

Theorem 3.7. For any fixed c > 1, the p-c-PUMPKlN-CovERiNG problem can he solved 
in time 2^^^^ ■ n'^^^). 

Proof. To obtain the desired result, by Lemma [3. II it is sufficient to obtain an algorithm 
for p-c-DiSJOiNT Cover. Recah the measure /x = cc{G[S]) + k. Now by Lemma [3^ 
either we branch according to branching rule B, or we can apply protrusion rule P and 
get a smaller instance, or we have that |Vi| < h{c) ■ k. In the first case we branch into 
a(c) ways and in each branch the measure decreases by one as either we include a vertex 
in our potential solution or we decrease the number of connected components of G[S]. 
Here a{c) is a constant that only depends on c. This implies that the number of nodes 
in the branching tree is upper-bounded by a{c)^ < a{c)'^^~^^ = 2^^^\ In the second 
case, we get a smaller instance in polynomial time. Finally, in the third case, using 
Lemma 13.31 we get that |A = Vi U V>2\ = 0{k). Thus using Lemma 13.21 we can get an 
equivalent instance (G*, 5, k*) with 0{k) vertices and hence the problem can be solved 
by enumerating all subsets of size at most k* of G* \ S. This concludes the proof. □ 

4. An approximation algorithm for covering and packing pumpkins 

In this section we show that every n- vertex graph G either contains a small c-pumpkin- 
model or has a structure that can be reduced, giving a smaller equivalent instance for 
both the Minimum c-Pumpkin-Covering and the Maximum c-Pumpkin-Packing 
problems. Here by a "small" c-pumpkin- model, we mean a model of size at most /(c) • 
logn for some function / independent of n. We finally use this result to derive a 
©(log n)-approximation algorithm for both problems. 

This section is organized as follows. We first describe in Section BT] our reduction rules 
and prove their validity for both covering and packing problems (see Lemma l4.ip . The 
existence of small c-pumpkin- models in c-reduced graphs is provided in Section 14.31 (see 
Lemma [4.1ip : its proof strongly relies on a graph structure that we call hedgehog^ which 
we study in Section [4.21 We finally focus in Section [4.41 on the algorithmic consequences 
of our results (see Theorems 14.121 and I4.13P . 

4.1. Reduction rules. We describe two reduction rules for hitting/packing c-pumpkin- 
models, which given an input graph G satisfying some specific conditions, produce a 
graph H with less vertices than G and satisfying Tc{G) = Tc{H) and vdG) = Vc{H). 
Moreover, these operations are defined in such a way that, for both problems, an optimal 
(approximate) solution for G can be retrieved in polynomial time from an optimal (resp. 
approximate) solution for H. 

An outgrowth of a graph G is a triple (C, u, v) such that 
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Figure 2. The graph r{C,u,v) of two outgrowths {C,u,v). We have 
A(C, u,v) = 8 < 9 = 7(C, u, v) m the left one, while A(C, u,v) = 7 > 5 = 
7(C, u, v) holds for the right one. 




Figure 3. Illustration of reduction rule Z2 on the two outgrowths from Fig. [5J 

(i) u, V are two distinct vertices of G; 

(ii) C is a connected component of G\ {u,v} with \C\ > 2; and 

(iii) u and v both have at least one neighbor in C in the graph G. 

Given an outgrowth (C, u, v) of a graph G, we let r(C, u, v) denote the subgraph of 
G induced by V{G) U {u, v} where all the edges between u and v are removed. We let 
A(C, u, v) be the graph r(C, n, f ) where an edge between u and has been added. Also, 
we define 7(C, u, v) as the largest integer k such that r(C, it, i)) has a /c-pumpkin-model 
i?} with u ^ A and v £ B, and A(C, it, as the largest integer k such that A(C, u, v) 
has a A:-pumpkin-model {A,B} with it,?; € A. See Fig. [2] for an illustration. 

Now that we are equipped with these definitions and notations, we may describe the 
two reduction rules, which depend on the value of the positive integer c. 

Zl Suppose i; is a vertex of G is such that no block of G containing v has a c- 
pumpkin- minor. Then define H as the graph obtained from G by removing 

V. 

Z2 Suppose (C, u, v) is an outgrowth of G such that r(C, n, v) has no c-pumpkin- 
minor. Assume further that Zl cannot be applied on G. Let 7 := 7(C, n, i;) 
and A := X{C,u,v). If A < 7, define H as the graph obtained from G \ V{G) 
by adding 7 parallel edges between u and v. Otherwise, define H as the graph 
obtained from G \ V{G) by adding a new vertex vc and linking vc to u with 7 
parallel edges, and to v with A — 7 parallel edges. 

See Fig. [3] for an illustration of Z2. We note that testing for the existence of a k- 
pumpkin-model {^4, B} in a graph G can be done in polynomial time when k is a fixed 
constant, and this remains true if we moreover require u £ A and/or v £ B, where 
u, V are two specified vertices of G. This follows from classical results of Robertson and 
Seymour [26]. In particular, the parameters ^{GjUjv) and X{C,u,v) can be computed 
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in polynomial time when C has no c-pumpkin minoiQ- Therefore, the reduction rule Z2 
can be applied in polynomial time. 

A graph G is said to be c-reduced if neither Zl nor Z2 can be applied to G. The next 
lemma shows the validity of these reduction rules. 

Lemma 4.1. Let c be a fixed positive integer. Suppose that H results from the applica- 
tion of Zl or Z2 on a graph G. Then 

(a) Tc{G) = Tc{H) and moreover, given a c-hitting set X' of H, one can compute in 
polynomial time a c-hitting set X of G with \X\ < \X'\. 

(b) Vc{G) = T^c{H) and moreover, given a c-packing Ai' of H , one can compute in 
polynomial time a c-packing Ai of G with = \M'\. 

In order to prove Lemma l4.ll we first need to introduce a few technical lemmas; the 
validity of the reduction rules is shown in Lemmas 14.51 and 14.61 at the end of this section, 
which correspond to Lemma k.lT a) and Lemma k.lT b). respectively. 

Lemma 4.2. Let (C, u, v) be an outgrowth of a graph G, and suppose that Zl cannot 
be applied on G. Then X{C,u,v) < 2j{G,u,v). 

Proof. Let 7 := ^{G,u,v), T := T{G,u,v), A := X{G,u,v), and A := A{G,u,v). Let 
{A, B} be a A-pumpkin-model in A with u,v G A, and such that B is inclusion-wise 
maximal with that property. Observe that if there is some vertex x in V{A) \ A that 
is adjacent to some vertex in B in the graph A, then we could have added x to B, 
which contradicts its maximality. Hence, every vertex outside B that is adjacent to 
some vertex in B is included in A. We will use this fact later in the proof. 

If A is not 2-connected, then A has a cutvertex x (since |A| > 3|). The vertex x 
cannot be u nor v, because A — u and A — -y are both connected. Also, the two vertices 
u, V are in the same block of A (because of the edge uv). It follows that there is a block 
X of A that includes x but avoids u and v. Then K is also a block of G, and we could 
have applied Zl on any vertex in V{K) \ {x}, a contradiction. Therefore, A must be 
2-connected. 

Let us a call a vertex in V{A) \ B that is adjacent to at least one vertex in B simply 
a neighbor of B. Since A is 2-connected, B has at least two neighbors. Also, as we have 
seen previously, every neighbor of S is in ^. 

For every two distinct neighbors x,y of B, there are two vertex-disjoint paths Pi, 
P2 between {u, v} and {x, y} in A, since the graph is 2-connected. Let us choose such 
vertices x,y and paths Pi,P2 in such a way that + |i-*2| is minimized. Exchanging 
Pi with P2 if necessary, we may assume u G V{Pi) and v G V{P2). It follows from our 
choice of x, y, Pi, P2 that V{Pi) n 5 = and V{P2) n P = 0. 

The paths Pi and P2 also exist in the graph F, since they cannot use the edge uv. 
Let us build two disjoint sets Su and of vertices iteratively as follows. First let 
Su '■= ^(Pi) and Sy := V{P2). Then, for every neighbor z of P that is distinct from x 
and y, consider a shortest path Q between Su U Sy and {z} in T[A]. (Such a path exists 
since z £ A, and F[^] is either connected, or has two connected components and u,v 
are in distinct components.) Then V{Q) has a non-empty intersection with either Su or 
Sy (but not both); add all vertices of V{Q) to Su in the first case, to Sy in the second 
case. 

By definition, Su, Sy, and B are mutually disjoint, the two graphs T[Su] and T[Sy] 
are connected, and every neighbor of B is in either Su or Sy. Thus, letting p and q be 
the number of Sy-B edges and Sy-B edges in F, respectively, we have p + q = X. On the 



We note that A(C, u, v) can equivalently be defined as the largest integer k such that A(C, u, v) /uv 
has a fc-pumpkin-model {A, B} with u £ A. 
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other hand, {Su, B U 5^,} is a p-pumpkin-model in T with u G Su, v £ B U Sy implying 
7 > p. Similarly, {Su U B,Sv} is a g-pumpkin-model in F with u&Su^B,v£B, 
showing 7 > (?. Therefore, A = p + g < 27, as claimed. □ 

Lemma 4.3. Let c be a fixed positive integer. Suppose H is obtained by applying rule 
Z2 on an outgrowth (C, u, v) of a graph G. Let X be an arbitrary subset of vertices 
ofV{G) \ {y{C) U {u, u}). Then, given a c-pumpkin-model of H\X, one can find in 
polynomial time a c-pumpkin-model of G\X . 

Proof. Let V := V{C,u,v), A := A(C, u, u), 7 := 7(C,n,-y) and A := A(C,n,-y). Let 
{A,B} denote the given c-pumpkin-model of I{\X. We may assume that this model 
is minimal (if not, one can obviously make it minimal in polynomial time). 

Case 1: A < 7. 11 u ^ Au B or v ^ Au B, then {A, 5} is a c-pumpkin-model in 
G\X and we are done. Thus, exchanging A and B if necessary, we may assume that 
either u,v ^ A, or u ^ A and v £ B. In the first case, since G[A U ^(C)] is connected, 
{A U V{G),B} is a c-pumpkin-model in G\X. In the second case, A and B both 
induce connected subgraphs of G, and there are 7 extra edges between A and B in H 
compared to G. Let {A' , B'} be a 7-pumpkin- model in T with u £ A' and v £ B' . Then 
{A U A' ,B U B'} is a c-pumpkin-model in G\X, as desired. 

Case 2: A>7-|-1. li u ^ AU B or v ^ Au B , then by minimality {A, B} also avoids 
the vertex vc, and {A, B} is a c-pumpkin-model in Thus again, exchanging A and 

B if necessary, we may assume that either u,v £ A, or u £ A and v £ B. Suppose the 
former holds. If vc ^ B then setting A' := {A\{vc}) U 1^(C), we deduce that {A', B} is 
a c-pumpkin-model in G\X. (Note that this is true independently of whether vc £ A 
or not.) If Vc £ B, then B = {vc} and the number of edges between A and B in 
-ff is 7 -|- (A — 7) = A. Hence, A > c. Letting {A',B'} be a A-pumpkin-model in A 
with u,v £ A', we have that G[A U A'] is connected. It follows that {^4 U A',B'} is a 
c-pumpkin-model in G\X. 

Now assume u £ A and v £ B. By Lemma [4.2l we have A < 27, and thus A — 7 < 7. If 
vc ^ AuB, then {A, B} is a c-pumpkin-model of G\X. If vc £ AuB, then the number 
of edges in H between A and B that are incident with vq is at most max{7, A — 7} =7. 
Hence, letting {A' , B'} be a 7-pumpkin-model of F with u £ A! and v £ B' , we deduce 
that {^4 LlA',BLI B'} is a c-pumpkin-model in G\X. 

Finally, we note that each step of the above proof can be realized in polynomial time. 
This follows from the fact that the parameters 7 and A and the associated models can 
be computed in polynomial time. □ 

Next we show that the converse of the above lemma also holds. 

Lemma 4.4. Let c,G,H,{G,u,v), and X be as in Lemma \4-3[ Then, given a c- 
pumpkin-model of G\X , one can find in polynomial time a c-pumpkin-model of H\X . 

Proof. Let F := T{C,u,v), A := A{C,u,v), 7 := ^{G,u,v) and A := X{G,u,v). Let 
{A, B} denote the given c-pumpkin-model of G\X. We may assume that this model is 
minimal. 

Case 1: A < 7. U u ^ AU B 01 v ^ AU B, then A,Bn V{G) = (by minimality 
of {A,B}) and thus {A,B} is a c-pumpkin-model in H \X. Thus, exchanging A and 
B if necessary, we may assume that either u,v £ A, or u £ A and v £ B. First suppose 
the former holds. If B has no vertex in C, then {A \ V{G),B} is a c-pumpkin-model 
in H \ X. If B contains some vertex of G then B C V{G). Since {A n V{A), B} is a 
c-pumpkin-model in A with u,v £ Ar]V{A), we have A > c. On the other hand, we have 
7 < c, since F has no c-pumpkin-minor. It follows that c<A<7<c, a contradiction. 
Thus B cannot contain a vertex of C, which concludes the case where u,v £ A. 
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Now assume u G A and v £ B. In the graph F, the number of edges between 74ny(r) 
and B n V^(r) is at most 7 (by definition of 7). Since there are 7 extra uv edges in H 
compared to G, we deduce that {A \ V{C), B \ V{C)} is a c-pumpkin-model in H \ X, 
as desired. 

Case 2: A > 7 + 1. For the same reasons as in the A < 7 case, we must have 
u,v 6 AL) B, and thus we may assume that either u,v £ A, or u £ A and v £ B. 
Suppose the former holds. If B does contain any vertex in C then {{A\V{C))L){vc},B} 
is a c-pumpkin-model oi H \ X. If B contains some vertex in C then B C V[C), and 
{An V{A),B} is a c-pumpkin-model of A with u,v £ Af] V{A), implying A > c, and 
that {A \ V{C), {vc}} is a c-pumpkin-model oi H \ X. 

Now assume u £ A and v £ B. In F, there are at most 7 edges between AriV{T) and 
B n V{r). Thus {A \ V{C), {B \ V{C)) U {vc}} is a c-pumpkin-model in \ X. □ 

Lemma 4.5. Let c he a fixed positive integer. Suppose H results from the application 
of Zil or Z2 on a graph G. Then Tc{G) = Tc{H). Moreover, given a c-hitting set X' of 
H one can compute in polynomial time a c-hitting set X of G with \X\ < \X'\. 

This lemma implies that an optimal solution to the Minimum c-Pumpkin-Covering 
problem on G can be computed given one for H, and similarly that an approximate 
solution for G can be obtained from an approximate solution for H. This will be used 
in our approximation algorithms in Section 14.41 

Proof of Lemma \4.5\ First suppose H results from the application of Zl on G with 
vertex v. We trivially have Tc{G) > Tc{H). Let X' be a given c-hitting set of H. If X' 
is not a c-hitting set of G, then G\X' has a c-pumpkin-model; let {A, B} be a minimal 
one. We have v £ Au B since otherwise {^4, B} would be a c-pumpkin-model in H\X' . 
By the minimality of {A, B}, we must have AU B CI V{K) for some block K of G. But 
then X is a block of G including v and containing a c-pumpkin-minor, contradicting the 
assumptions of Zl. Therefore X := X' is a c-hitting set of G, and Tc{G) < Tc{H) also 
holds, implying Tc(G) = Tc{H). 

Now assume H has been obtained by applying Z2 on G with outgrowth (G, u, v). Let 
F := F(G, n, v), A := A(G, u,v), 7 := 7(G, u, v) and A := A(G, u, v). 

Observe that every minimal c-pumpkin-model of G containing a vertex of G must 
contain both u and v. This is because F has no c-pumpkin-minor. We also note that, in 
order to cover all c-pumpkin-models of G, it is enough to cover those that are minimal. 

First we show Tc(G) > Tc{H). Let X be a minimum c-hitting set of G. \{ u £ X or 

G X, then X is trivially a c-hitting set of H, so let us assume u,v ^ X. Moreover, we 
may suppose that X has no vertex in G, since otherwise we could replace all such vertices 
with the vertex u (or equivalently v). Since X C V{G) \ {V{C) U {u, v}) and G\X has 
no c-pumpkin-minor, it follows from Lemma 14.31 that H \ X has no c-pumpkin-minor 
either, that is, X is also a c-hitting set of H. This shows Tc{G) > Tc{H). 

Now we prove that Tc{G) < Tc{H) also holds. Here we show that, given a c-hitting 
set X' of H, one can find in polynomial time a c-hitting set X of G with \X\ < \X'\. 
Hence, this will also prove the second part of the lemma. 

First, if A > 7 and vc £ X' , we replace vc in X' by one of u,v; the modified set 
X' clearly remains a c-hitting set of H, and its size did not increase (note that it could 
decrease if u and v were already in X'). Thus we may assume X' C V{G) \ {V{C) U 
{u,v}). 

If u £ X' or V £ X' , then X := X' is also a c-hitting set of G, since F has no 
c-pumpkin-minor. If u,v ^ X' , then Lemma 14.41 implies that G\X' has no c-pumpkin- 
minor. Hence, X := X' is a c-hitting set of G. This shows Tc{G) > Tc{H), and therefore 
r,(G) = T,{H). □ 
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We conclude this section with a lemma similar to Lemma 14.51 for c-packings. 

Lemma 4.6. Let c be a fixed positive integer. Suppose H results from the application 
of Zl or Z2 on a graph G. Then I'dG) = Vc{H). Moreover, given a c-packing M' of 
H one can compute in polynomial time a c-packing M of G with \M\ = 

Proof. First suppose H results from the application of Zl on G with vertex v. Clearly, 
every c-packing of is a c-packing for G. Thus Uc{G) > udH), and it is enough to 
show the reverse inequality. Consider a c-packing of G. We may assume that every 
c-pumpkin-model in that packing is minimal. Thus each such model is contained in 
some block of G, and hence avoids the vertex v. Therefore the packing also exists in H, 
implying z/c(G) < ^dH) and fc(G) = Uc{H), as desired. 

Now assume H has been obtained by applying Z2 on G with outgrowth (C, u, v). Let 
7 := 7(C, u, v) and A := A(C, u, v). 

First we show vdG) > fc(-ff). Let M' = {M[, ... , M(} be a given c-packing of H. We 
show that a packing of the same size in G can be computed in polynomial time, which 
will prove the second part of the lemma. We may assume that each AI- is minimal (if 
not, they can be made minimal in polynomial time). Thus if A > 7 and M| contains the 
vertex vc then M- also includes both u and v. It follows that if every M- avoids at least 
one of u, V then the packing Ai := Ai' is a c-packing in G and we are done. So assume 
one model in the collection, say without loss of generality M{, includes both u and v. 
Let X be the union of the vertices in M2 , ■ ■ ■ , M'^ . Since M[ is a c-pumpkin-model in 
H\X, using Lemma 14.31 we can compute in polynomial time a c-pumpkin-model Mi in 
G\X. Hence A4 := {Mi, Mg, . . . , Mj^} is c-packing of the desired size in G. 

In order to prove vd^) — ^d^) it remains to show ud^) — ^d^)- Let {Mi, . . . , M^} 
be a c-packing of G. Again we may assume that each Mj is minimal. Thus if some Mj 
contains some vertex of G then Mj contains both u and v. If there is no such model in 
the packing then {Mi, . . . , M^} is also of c-packing of H and we are done. We may thus 
assume that some model in the packing, say without loss of generality Mi, contains both 
u and V. As before, let X be the union of the vertices in M2, . . . , M^. Using Lemma [4.41 
with Ml we find a c-pumpkin-model M( m. H\X. Thus {M{, M2, . . . , M^} is a c-packing 
of size k in as desired. □ 

4.2. Hedgehogs. A graph is said to be a multipath if its underlying simple graph is 
isomorphic to a path. If P is a multipath and u,v £ V{P), we write uPv for the 
subgraph of P induced by the vertices on a u-v path in P (thus edges in uPv have the 
same multiplicities as in P). 

A hedgehog is a pair {H, P), where H is a graph and P is an induced multipath of H 
with |P| > 2 and such that 

(i) the (possibly empty) set S := V{H) \ V{P) is a stable set of H; and 

(ii) every vertex in S has at least two neighbors in P. 

(Let us recall that a stable set is a set of vertices such that no two of them are adjacent.) 

Consider a hedgehog {H, P). Its size is defined as |P|, the number of vertices in P. A 
bad cutset of {H, P) is a set X = {u, v} of two internal vertices of P such that H\X has 
a connected component G avoiding both endpoints of P with \C\ > 2. This definition is 
motivated by reduction rule Z2: if C is such a component, then (C, u, v) is an outgrowth 
of H. A rooted c-pumpkin-model of {H, P) is a c-pumpkin-model {A, B} of H with the 
extra property that A and B both contain an endpoint of P. 

Given a hedgehog {H,P) and a connected induced subgraph Q oi P with \Q\ > 2, 
one can define a hedgehog {H',Q) as follows: First, remove from H every vertex not 
in P that has no neighbor in Q. Then contract every edge of P not included in Q. 
Finally, remove from the graph every vertex not in Q that has only one neighbor in Q. 
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Figure 4. A hedgehog {H, P) (left) and a contraction {H', Q) of {H, P) 
(right). The multipaths P and Q are drawn in bold. 

This defines the graph H' . We leave it to the reader to check that {H',Q) is indeed 
a hedgehog; we say that {H',Q) is the contraction of {H,P) on the multipath Q. See 
Fig. m for an illustration of this operation. The following lemma is a direct consequence 
of the definition. 

Lemma 4.7. If {H' , Q) is a contraction of a hedgehog (H, P) and X is a bad cutset of 
{H',Q), then X is also a bad cutset of {H,P). 

We show that every big enough hedgehog has a rooted c-pumpkin-model or a bad 
cutset. This fact will be useful in the subsequent proofs. 

Lemma 4.8. Let c be a fixed positive integer. Then every hedgehog {H, P) of size at 
least (Ac)'^'^ contains a rooted c-pumpkin-model or a bad cutset, either of which can be 
found in polynomial time. 

Proof. The proof is by induction on c. The base case c = 1 is trivial since P directly 
gives a rooted 1-pumpkin-model. For the inductive step, assume c > 1. Define f{k), for 
a positive integer k, as f{k) := (4^)'*'^. Let S := V{H)\V{P). Let a,b be the endpoints 
of P. 

If a vertex v £ S has at least c neighbors in P, then let w be the neighbor of v that is 
closest to a on P. Then A := V{aPw)L){v} and B := V{P)\A both induce a connected 
subgraph of H. Moreover, there are at least c — 1 edges from v to B, and at least one 
from A \ {v} to B (because of P). Since a £ A and b £ B, we deduce that {A,B} is a 
rooted c-pumpkin-model of {H, P). Thus we may assume that every vertex in S has at 
most c — 1 neighbors in P. In particular we have c > 3, since every vertex in S has at 
least two neighbors in P. 

The multipath P, seen from its endpoint a, induces a natural linear ordering of the 
neighbors of a given vertex in S; we say that two such neighbors are consecutive if they 
are consecutive in that ordering. 

Suppose that there exists a vertex v £ S with two consecutive neighbors x, y such 
that > f{c — 1) -|- 2. Consider the contraction {H',Q) of {H,P) on the multipath 

Q := xPy \ {x,y}. Since \Q\ > f{c — 1), by induction {H',Q) has a rooted (c — 1)- 
pumpkin-model {A',B'} or a bad cutset X. If the latter holds, then by Lemma 14.71 
the set X is also a bad cutset of {H, P) and we are done. Thus we may assume the 
former holds. In the graph H, the vertex v has no neighbor in Q, thus v is not included 
in H' . Hence, we can obtain a rooted c-pumpkin-model {A,B} in {H,P) by setting 
A := A' L)V {aPx)L) {v} and B := B' L)V{yPb). Therefore we can assume that, for every 
vertex v £ S, every two consecutive neighbors of v are at distance at most /(c — 1) on 
P. 

Let us enumerate the vertices of P in order as pi,p2, ■■■ ,Pk, with pi = a and pk = b. 
We may assume that, for every i £ {2, . . . ,k — 4}, 

(2) at least one of pj+i, pi+2 is adjacent to some vertex in 5*. 

Indeed, if not then {pi,Pi+3} would be a bad cutset of {H,P). Since A; = |P| > /(c) > 
/(3) > 5, this implies in particular that S is not empty. 
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Define an open interval 1^ = for every vertex v £ S, where i (j) is the smaUest 
(largest, respectively) index t such that pt is a neighbor v in H. (Observe that i < j 
since v has at least two neighbors.) Now, let G be the interval graph defined by these 
open intervals, that is, let V{G) := S, and for every two distinct vertices VjW £ S, make 

V adjacent to -u; in G if and only if n / 0. 

For a connected subgraph G' of G, we define I{G') as the union of the intervals of 
vertices in G', that is, I{G') := |J{/„ : v G V{G')}. Observe that, since G' is connected, 
we have I{G') = for some integers i,j with 1 < i < j < k. 

First suppose that G has at least five connected components. The ordering pi, . . . ,pk 
of the vertices of P induces an ordering of these components; let G, G' , G" be three 
consecutive connected components in that ordering, with the property that G is not first 
and G" is not last in the ordering. Let := I{G), := I{G'), and := 

I{G"). Then we have 1 < i < j < i' < j' < i" < j" < k, and every vertex of S that 
is adjacent to some vertex in {pi+i, ■ ■ ■ ,pj'i-i} has all its neighbors in that set. Since 
/' > i + 3, it follows that {pi,Pj"} is a bad cutset of {H,P). Hence, we may assume 
that G has at most four connected components. 

A vertex pt {t £ {1, ... ,k}) is said to be exposed if there is no connected component 
C of G with interval I{G) = such that i <t < j. If G has q connected components, 
it follows from ([2]) that at most q + f) vertices of P are exposed. Since 9 < 4, it follows 
that G has a connected component G such that /(G) = (x, y) with 

-9 /(c) -9 

Let Q := PxPpy and let {H' , Q) be the contraction of {H, P) on Q. We will show that 
{H', Q) contains a rooted c-pumpkin-model. The lemma will then follow, since such a 
model can be extended straightforwardly to one of {H, P). 

First let us observe that H' is an induced subgraph of H (by our choice of Q). Let 
S' := V{H') \ V{Q) = V{G). For a vertex v £ S' , let us denote by l{v) and r{v) the two 
integers such that = {l{v),r{v)). 

It follows from our assumptions on {H,P) that, for every vertex v £ S' , the vertex 

V has at most c — 1 neighbors in Q and every two consecutive neighbors of v are at 
distance at most /(c — 1) on Q. This implies 

(4) ^(^)_/(^) < (c-2)/(c-l) 
for every v £ S' . 

In H' , the vertices px and py each have at least one neighbor in S' . Let v £ S' he 
a neighbor of Px maximizing r{v), and let w £ S' he a neighbor oi Py minimizing l{w). 
Let Z he a shortest v-w path in the interval graph G; enumerate the vertices of Z as 
zi, Z2, ■ ■ ■ , Zm with zi = V and Zm = w. 

By our choice of f , w and the fact that Z is a shortest v-w path, we have 

(5) l{zj) < l{zj+i) 

(6) r{zj) < r{zj+i) 

for every j £ {1, . . . ,m — 1}, and 

(7) r(zj) < l{zj+2) 

for every j £ {1, . . . , m — 2}. 

Since the interval I{Z) = I{G) = {x,y), it follows from ([3]) and ([3]) that 



(8) m> > /(^) - ^3 ^ {^cr - 13 

- (c-2)/(c- 1) - 4(c-2)/(c- 1) 4(c-2)(4c-4)4c-4 ^ ■ 
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Figure 5. Illustration of the coloring for d = 2. The vertices in 
{zi, . . . , Z2d} are on top, the path Q at the bottom, and each breakpoint 
Pi of Q is linked to the vertices Zj such that j £ J{pi)- 

Let d := [m/2j. Define, for i G {x,... ,y}, the set J{pi) as the set of indices j G 
{!,..., 2d} such that i G {l{zj),r{zj)}. We say that pi is a breakpoint of Q if J(pi) is not 
empty. (Thus px is a breakpoint in particular.) It is a consequence of ([5]), ([6]), and ([7|) 
that, if I J(pi)| > 1, then J{pi) = {j,j + 2} for some j G {1, . . . ,2d — 2}. In particular, 
the numbers in J{pi) always have the same parity. 

We color the vertices in V{Q) U {zi, . . . , Z2d} in black or white as follows. First, for 
every j G {1, . . . ,2d}, color Zj black if j is odd, white if j is even. Next color every 
breakpoint pi of Q with the color corresponding to the parity of the numbers in J{pi) 
(namely, black for odd and white for even). Finally, color every uncolored vertex of Q 
with the color of the closest breakpoint of Q in the direction of px- See Fig. [5] for an 
illustration of the coloring. 

Let A and B be the set of black and white vertices, respectively. By construction, 
Px £ A and py £ B, and each A,B induces a connected subgraph of H' . Moreover, 
there are 2d+l > m > c edges of Q whose endpoints received distinct colors. It follows 
that {A,B} is a rooted c-pumpkin-model of {H',Q), as desired. 

We have shown that (H, P) always has a rooted c-pumpkin-model or a bad cutset. 
Moreover, it is easily seen from the proof given above that each of these can be found 
in polynomial time. This concludes the proof of the lemma. □ 

We note that no effort has been made in order to optimize the constants in Lemma [4.8l 

4.3. Small pumpkins in reduced graphs. Our goal is to prove that every n-vertex 
c-reduced graph G has a c-pumpkin-model of size Od^ogn). We will use the following 
recent result by Fiorini et al. [16] about the existence of small minors in simple graphs 
with large average degree. 

Theorem 4.9 (Fiorini et al. [16]). There is a function h such that every n-vertex simple 
graph G with average degree at least 2* contains a Kf-model with at most h{t) ■ logn 
vertices. 

Even though the authors of [16] do not mention it explicitly, we note that the proof 
given in that paper can easily be turned into a poly-time algorithm finding such a Kt- 
model. Since a Kt-model in a graph directly gives a c-pumpkin-model of the same size 
for c = ([t/2j)^, we have the following corollary from Theorem 14. 9[ which is central in 
the proof of Lemma 14.111 

Corollary 4.10. There is a function h such that every n-vertex simple graph G with 
average degree at least 2^^+^ contains a c-pumpkin-model with at most h{c) ■ logn 
vertices. Moreover, such a model can computed in polynomial time. 

The next lemma states the existence of small c-pumpkin-models in c-reduced graphs; 
its proof strongly relies on the graph structure that we have called hedgehog (see Sec- 
tion 14. 2|) . The idea is that an hedgehog witnesses the existence of either a small c- 
pumpkin-model or an outgrowth, which is impossible in a c-reduced graph. 
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Figure 6. The sets W,V, and C in the graph G. 



Lemma 4.11. There is a function f such that every n-vertex c-reduced graph G contains 
a c-pumpkin-model of size at most f{c) -logn. Moreover, such a model can he computed 
in polynomial time. 



Proof. Let 



k :-- 

r :- 
b :-- 



{4c)^^k 



We wiU prove the lemma with / defined as 

/(c) := maxjfcrfc, 3rc • ^(c)}, 

where h is the function in Corollary 14.101 Throughout the proof, a c-pumpkin-model 
of G is said to be small if it has the required size, that is, if it has at most /(c) logn 
vertices. 

The lemma trivially holds if ^(G) > c, so we may assume ^{G) < c. Let W be the 
(possibly empty) subset of vertices of G having degree at least k. 

We build a collection V of vertex-disjoint induced subgraphs of G\W , each isomorphic 
to a multipath on r vertices. Initially, we let "P := and G' := G \ W . Then, as long as 
G' has a connected component C with diameter at least r — 1, we do the following: First, 
we consider two vertices at distance r — 1 in C and compute a shortest path Q between 
these two vertices. Note that the subgraph P of G induced by V{Q) is a multipath on 
r vertices. Next, we add P to V. Finally, we remove from G' the r vertices in P. 

When the above procedure is finished, every connected component of G' has diam- 
eter less than r — 1 and maximum degree less than k. Hence each such component 
has bounded size: at most k^ = b vertices. Let C denote the collection of connected 
components of G' . 

An illustration of the sets W, V, and C in the graph G is given in Fig. O 

If some subgraph C £ C contains a c-pumpkin-model, then the size of the model is 

at most |C| < 6 < /(c) < /(c) logn, and we are done. Thus we may assume that no 

subgraph C € C contains a c-pumpkin-minor. 
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Let J be the graph obtained from G by contracting each subgraph C G C into a 
single vertex vc- Consider a subgraph C G C. We cannot have deg}(uc) = 0, because 
otherwise we could have applied Zl on any vertex vertex of C in G (since C has no 
c-pumpkin- minor) . If degj(fc) = 1, then let v be an arbitrary vertex of C, and let w 
be the unique vertex in V{G) \ V{C) having a neighbor in V{C) in the graph G. Since 
Zl cannot be applied on G with vertex v, there is a block B of G that includes v and 
containing a c-pumpkin-model. Since V{B) C V{G) U {w}, this model has size at most 
\B\ < 6 + 1 < /(c), that is, we have found a small c-pumpkin-model of G. Therefore, 
we may assume 

(9) deg}(^c) > 2 
for every G £ C. 

Let K be the graph obtained from J by contracting each subgraph P £ V into a 
single vertex vp. If two vertices of K are linked by at least c parallel edges, then we 
directly find a c-pumpkin-model in G of size at most b + r < f{c). Thus we may assume 

(10) n{K) < c. 

We have deg^(t;c') > 1 for every C £ C. Let us say that a subgraph C G C is bad if 
degj^(fc) = 1, and good otherwise. 

We color the vertices of each multipath P £ V as follows: a vertex v G ViP) is colored 
black if, in the graph G, all its neighbors outside P belong to bad subgraphs of C; the 
vertex v is colored white otherwise. (We remark that v could possibly have no neighbor 
outside P, in which case v is colored black by our definition.) 

Claim 4. // some multipath P £ V contains (4c)^'^ consecutive black vertices, then one 
can find a small c-pumpkin-model in G. 

Proof. Let Q be the subgraph of P induced by these (4c)^'^ black vertices. Let C be the 
subset of subgraphs G £ C such that vc is adjacent to an internal vertex of Q in the 
graph J. Let S := {vc ■ G £ C'}. Since all vertices of Q are colored black, it follows 
that internal vertices of Q are only adjacent in J to vertices in V{P) U S, and that every 
subgraph C G C is bad. 

Let H be the graph obtained from J[V{P) U S] by contracting every edge of P not 
included in Q. Since every subgraph C G C is bad, it follows from ^ that, in H, 
every vertex in S has at least two neighbors in Q. Hence {H,Q) is a hedgehog of size 

\Q\ = i^cr- 

The graph H is a minor of the subgraph G* of G induced by 

^(^) U |J{'*^(C) ■ C £ C']. 
Since vertices of P have degree at most k vn G and |C| < 6 for every G £ C , we have 

(11) \G*\ <r + r{k-l)b< f{c). 

It follows that every c-pumpkin-model of H can be extended to a small c-pumpkin-model 
of G. 

Applying Lemma 14.81 on {H,Q), we obtain either a bad cutset X of {H,Q) or a c- 
pumpkin-model of H. We have seen that we are done in the latter case, so assume the 
former holds and let {u,v} =: X. Consider a connected component T of H \ X that 
avoids both endpoints of Q and such that \T\ > 2. Let Z be the subgraph of G induced 
by {V{T) n V{Q)) U U{^('^) : vc £ V(T)}. It follows from the definition of H and our 
choice of T that Z is a connected component of G \ X with \Z\ > 2 and such that u 
and V are both adjacent to some vertex in Z. Thus {Z, u, v) is an outgrowth of G. This 
implies that T := T{Z,u,v) has a c-pumpkin-model, because otherwise we could have 
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applied Z2 on G with the outgrowth {Z,u,v). Since F is a minor of G* , by (jlip this 
model can be extended to a small c-pumpkin-model of G. □ 

By Claim m we may assume that, for every P £ V, the number ■w{P) of white vertices 
in P satisfies 

(12) nj{P) >j^,=k. 

Our aim now is to use (|12p to define a minor of K with large minimum degree. First, 
for every good subgraph C G C, "assign" vc to an arbitrary neighbor of vc in K. Next, 
for every w G W , contract all edges vqw of K into the vertex w for all vertices vc 
assigned to w. Similarly, for every P £ V, contract all edges vcvp into the vertex vp 
for all vertices vc assigned to vp. Finally, remove the vertex vc for every bad subgraph 
G £ C. The resulting graph is denoted L. 

For every vertex of L there is a natural induced subgraph of G that corresponds to 
it, namely the subgraph defined by all the edges that were contracted into w. Let S^, 
and Sp be the (induced) subgraph of G that corresponds to the vertex w £ W and vp 
{P £ V) of L, respectively. The subgraphs {w £ W) and Sp {P £ V) of G have 
diameter at most 2r and 3r, respectively. Thus, by Lemma [2.H a c-pumpkin-model of L 
of size q can be turned into one of G of size at most 3rc • q. Hence, in order to conclude 
the proof, it is enough to find a c-pumpkin-model in L of size at most h{c) log \ L\, since 

Mc)log|L| < Miog|L| < 4^1ogn. 

Src ore 

To do so, we will show that L has large minimum degree. 

First consider a vertex w £ W. Let a be the number of edges incident with w in K 

such that the other endpoint is a vertex of the form vc (C £ C) that was assigned to 

w. By do]), w cannot be adjacent in if to a vertex vq corresponding to a bad subgraph 

C £ C. Thus, it follows from the definitions of good subgraphs and L that 

deg^iw) > + {degKiw) - a). 

(The ^jp^ term above comes from the fact that each vertex vc that was assigned to w 
contributes at least one to the degree of w in L, while in K there were at most f^{K) 
edges between vc and w.) Using (fTUj) we obtain 

13 degL{w)> - + {degK{w)-a)>^^^>-. 

c c c 

Now consider a multipath P £ V. Let a' be the number of edges incident with vp in 

K such that the other endpoint is a vertex of the form vc (C £ C) that was assigned to 

Vp. Let h' be the number of edges incident with vp in K that are not of the previous 

form and also not incident with a vertex vc such that C is bad. By the definition of 

white vertices, we have a' + h' > w{P) (recall that w{P) is the number of white vertices 

in P). Using ([12]), it follows 

/ .N , / N o,' ,/ a' ,/ w(P) k 

(14) deg^vp) > — + b'>- + b'> > -. 

/i(ivj c c c 

It follows from ()13p and ()14p that L has minimum degree at least k/c. If fJ,{L) > c, 
then L has a c-pumpkin-model of size two and we are trivially done, so let us assume 
//(L) < c. Then the underlying simple graph L' of L has minimum degree at least 
k/c^ > 22v^+^ Using Corollary SlU] on L' , we find a c-pumpkin-model in L of the 
desired size, that is, of size at most h{c) log \L\. 

Finally, we note that each step of the proof can easily be realized in polynomial time. 
Therefore, a small c-pumpkin-model of G can be found in polynomial time. □ 
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4.4. Algorithmic consequences. Lemma 14.111 can be used to obtain O(logn)- 
approximation algorithms for both the Minimum c-Pumpkin-Covering and the Max- 
imum c-Pumpkin-Packing problems for every fixed c > 1, as we now show. 



Algorithm 1 A ©(log n)-approximation algorithm. 
INPUT: An arbitrary graph G 

OUTPUT: Ac-packing of G and a c-hitting set A of Gs.t. \X\ < (/(c) log |G|)-|A1| 
7W ^ 0; A ^ 

If |G| < 1: Return X /* G cannot have a c-pumpkin-minor */ 
Else: 

If G is not c-reduced: 

Apply a reduction rule on G, giving a graph H 

Call the algorithm on ff, giving a packing ^A' and a c-hitting set X' of H 
Compute using Lemma ITTT b) a c-packing M. oi G with \M.\ = |A^'| 
Compute using Lemma l4.1l fa) a c-hitting set A of G with |A| < |A'| 
Return M, X 

Else: 

Compute using Lemma 14.111 a c-pumpkin-model M = {A^B} of G with 

I^USI </(c)log|G| 
H ^G\{AUB) 

Call the algorithm on giving a packing Ai' and a c-hitting set X' of H 
M^M'yj {M} 
X ^ X'yjAVJB 
Return M , X 



Theorem 4.12. Given an n-vertex graph G, a O {log n)- approximation for both the 
Minimum c-Pumpkin-Covering and the Maximum c-Pumpkin-Packing problems 
on G can be computed in polynomial time using Algorithm^ for any fixed integer c > 1. 

Proof. Consider Algorithm [H where /(c) is the function in Lemma 14.111 We will show 
that this algorithm provides a ©(log n)-approximation for the two problems under con- 
sideration. 

It should be clear that the collection Ai returned by Algorithm [1] is a c-packing of G, 
and similarly that the set A is a c-hitting set of G. Thus it is enough to show that they 
satisfy |A| < (/(c) log |G|) • \M\ as claimed in the description of the algorithm. Indeed, 
since |A1| < i^dG) < Tc{G) < \X\ and /(c) is a constant depending only on c, this 
implies that the approximation factor of Algorithm [1] is ©(logn) for both the Minimum 
c-Pumpkin-Covering and the Maximum c-Pumpkin-Packing problems. 

We prove the inequality |A| < (/(c) log |G|) • by induction on |G|. The inequality 
is obviously true in the base case, namely when |G| < 1, so let us assume |G| > 1. 

If G is not c-reduced, then by induction the packing Ai' and the c-hitting set X' of 
H considered by the algorithm satisfy \X'\ < (/(c) log \H\) ■ \Ai'\, and we obtain 

|A| < \X'\ < {fic)log\H\) ■ \M'\ = if{c)log\H\) ■ \M\ < (/(c)log|G|) • \M\ 



as desired. 
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If G is c-reduced, then by induction the packing M' and the c-hitting set X' of H 
satisfy \X'\ < (/(c) log Iff]) • \M'\, and we have 

\X\ = \X'\ + \AUB\ 

< (f{c)log\H\) .\M'\+ f{c)log\G\ 

<(f{c)log\G\)-{\M'\ + l) 

= {f{c)log\G\)-\M\. 

Thus \X\ < (/(c)log|G|) • \M\ holds in ah cases. 

Finally, we observe that there are at most n recursive calls during the whole execution 
of the algorithm, which implies that its running time is polynomial in n. □ 

We would like to conclude by observing that Lemma [4 . 1 1 1 also implies the existence of 
an algorithm for p-c-CoVER with running time 2''^^^^"^^°^^^ ■n''^^^\ While it is faster than 
the 2'-'^^^°^^^ ■ n'-^(^) algorithm that follows directly from the kernel of size 
given by [18], it is of course not as good as the one presented in Section [3l Nevertheless, 
we find it interesting that a running time of 2'-^('^'°s^°s*^) • n'^^^^ can already be achieved 
without using techniques such as iterative compression. 

Theorem 4.13. There exists an algorithm to solve the p-c-CovER problem in time 
20{k\oglogk) . ^0(1) _ 

Proof. We only provide a rough sketch of the proof; the interested reader should have 
no problem filling in the missing details. 

First we compute the mentioned kernel K of size 0{k^log^/^k). Then we run the 
following recursive algorithm on K: 

• Apply iteratively reduction rules Zl and Z2 on K until K is c-reduced. 

• Compute using Lemma 14.111 a c-pumpkin-model M of of size 
0(log(A:2log3/2A:)) = 0{logk). 

• Branch on every vertex of M and repeat this procedure recursively. 

Note that, in the last step of the algorithm, every solution must contain at least one 
vertex of M. Hence this yields a parameterized algorithm with running time ©(log'' k) ■ 

5. Concluding remarks 
On the one hand, we provided an FPT algorithm running in time 

20(fc).^0(i) deciding, 

for any fixed c > 1, whether all c-pumpkin-models of a given graph can be covered by at 
most k vertices. In our algorithms we used protrusions, but it may be possible to avoid 
it by further exploiting the structure of the graphs during the iterative compression 
routine (for example, a graph excluding the 3-pumpkin is a forest of cacti). We did not 
focus on optimizing the constants involved in our algorithms; it may be worth doing it, 
as well as enumerating all solutions, in the same spirit of [20] for FEEDBACK Vertex 
Set. 

It is natural to ask whether there exist faster algorithms for sparse graphs. Also, 
it would be interesting to have lower bounds for the running time of parameterized 
algorithms for this problem, in the spirit of those recently provided in [24]. One could 
as well consider other containment relations, like topological minor, induced minor, or 
contraction minor. 

A more difficult problem seems to find single-exponential algorithms for the problem 
of deleting at most k vertices from a given graph so that the resulting graph has tree- 
width bounded by some constant. One could also consider the parameterized version of 
packing disjoint c-pumpkin-models, as it has been done for c = 2 in [5]. 
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On the other hand, we provided a ©(log n)-approximation for the problems of packing 
the maximum number of vertex-disjoint c-pumpkin- models, and covering all c-pumpkin- 
models with the smallest number of vertices. It may be possible that the covering version 
admits a constant-factor approximation; so far, such an algorithm is only known for 
c < 3. 

As mentioned in the introduction, for the packing version there is a lower bound 
of il(log^/^~^ n) on the approximation ratio (under reasonable complexity-theoretic as- 
sumptions). In fact, this lower bound applies to both the vertex-disjoint packing and 
the edge-disjoint packing [19]. For c = 2, the problem of packing a maximum number of 
edge-disjoint cycles admits a 0{y/log n)-approximation, whereas up to date 0{logn) is 
the best approximation ratio known for vertex-disjoint cycles [22]. Therefore, one might 
expect to get better approximation algorithms for packing edge-disjoint c-pumpkin mod- 
els. 

Finally, a class of graphs Ti has the Erdos-Posa property if there exists a function / 
such that, for every integer k and every graph G, either G contains k vertex-disjoint 
subgraphs each isomorphic to a graph in "H, or there is a set S C V{G) of at most f{k) 
vertices such that G\S has no subgraph in Ti. Given a connected graph H, let M{H) 
be the class of graphs that can be contracted to H. Robertson and Seymour [27] proved 
that M (H) satisfies the Erdos-Posa property if and only if H is planar. Therefore, for 
every c > 1, the class of graphs that can be contracted to the c-pumpkin satisfies the 
Erdos-Posa property. But the best known function / is super-exponential (see [12]), so 
it would be interesting to find a better function for this case. The only known lower 
bound on / is Q{klogk) when c > 2, which follows from the il.{klogk) lower bound 
given by Erdos and Posa in their seminal paper [14] for c = 2. 

References 

[1] V. Bafna, P. Berman, , and T. Fujito. A 2-approximation algorithm for the undirected feedback 
vertex set problem. SI AM Journal on Discrete Mathematics, 12(3):289-297, 1999. 

[2] R. Balasubramanian, M. Fellows, and V. Raman. An improved fixed parameter algorithm for Vertex 
Cover. Information Processing Letters, 65:163-168, 1998. 

[3] A. Becker and D. Geiger. Optimization of pearl's method of conditioning and greedy-like approxi- 
mation algorithms for the vertex feedback set problem. Artificial Intelligence, 83:167-188, 1996. 

[4] H. L. Bodlaender, F. V. Fomin, D. Lokshtanov, E. Penninkx, S. Saurabh, and D. M. Thilikos. 
(Meta) kernelization. In Proc. of the 50th Annual IEEE Symposium on Foundations of Computer 
Science (FOCS), pages 629-638, 2009. 

[5] H. L. Bodlaender, S. Thomasse, and A. Yeo. Kernel bounds for disjoint cycles and disjoint paths. In 
Proc. of the 17th Annual European Symposium on Algorithms (ESA), volume 5757 of LNC'S, pages 
635-646, 2009. 

[6] H. L. Bodlaender, J. van Leeuwen, R. B. Tan, and D. M. Thilikos. On interval routing schemes and 

treewidth. Information and Computation, 139(1):92-109, 1997. 
[7] Y. Cao, J. Chen, and Y. L. 0002. On feedback vertex set new measure and new structures. In Proc. 

of the 12th Scandinavian Symposium and Workshops on Algorithm Theory (SWAT), volume 6139 

of LNCS, pages 93-104, 2010. 
[8] J. Chen, B. Chor, M. Fellows, X. Huang, D. W. Juedes, I. A. Kanj, and G. Xia. Tight lower bounds 

for certain parameterized NP-hard problems. Information and Computation, 201(2):216-231, 2005. 
[9] J. Chen, I. A. Kanj, and G. Xia. Improved upper bounds for vertex cover. Theoretical Computer 

Science, 411(40-42):3736-3756, 2010. 
[10] J. Chen, Y. Liu, S. Lu, B. O'Sullivan, and I. Razgon. A fixed-parameter algorithm for the directed 

feedback vertex set problem. In Proc. of the 40th Annual ACM Symposium on Theory of Computing 

(STOC), pages 177-186, 2008. 

[11] F. K. H. A. Dehne, M. R. Fellows, M. A. Langston, F. A. Rosamond, and K. Stevens. An 0(2*^('^)n'^) 
FPT Algorithm for the Undirected Feedback Vertex Set Problem. Theory of Computing Systems, 
41(3):479-492, 2007. 

[12] R. Diestel. Graph Theory, volume 173. Springer- Verlag, 2005. 

[13] R. G. Downey and M. R. Fellows. Parameterized complexity. Springer- Verlag, 1999. 



HITTING AND HARVESTING PUMPKINS 



27 



[14] P. Erdos and L. Posa. On independent circuits contained in a graph. Canadian Journal of Mathe- 
matics, 17:347-352, 1965. 

[15] S. Fiorini, G. Joret, and U. Pietropaoli. Hitting Diamonds and Growing Cacti. In Proc. of the 
14th International Conference on Integer Programming and Combinatorial Optimization (IPCO), 
volume 6080 of LNCS, pages 191-204, 2010. 

[16] S. Fiorini, G. Joret, D. O. Theis, and D. R. Wood. Small Minors in Dense Graphs. Manuscript, 
available at http://arxiv.org/abs/1005.0895, 2010. 

[17] F. V. Fomin, S. Gaspers, D. Kratsch, M. Liedloff, and S. Saurabh. Iterative compression and exact 
algorithms. Theoretical Computer Science, 411(7-9):1045-1053, 2010. 

[18] F. V. Fomin, D. Lokshtanov, N. Misra, G. Philip, and S. Saurabh. Hitting forbidden minors: Ap- 
proximation and Kernelization. In Proc. of the 28th Symposium on Theoretical Aspects of Computer 
Science (STACS), volume 9 of LIPIcs, pages 189-200, 2011. 

[19] Z. Friggstad and M. Salavatipour. Approximability of packing disjoint cycles. Algorithmica, 60:395- 
400, 2011. 

[20] J. Guo, J. Gramm, F. Hiiffner, R. Niedermeier, and S. Wernicke. Compression-based fixed- 
parameter algorithms for feedback vertex set and edge bipartization. Journal of Computer and 
System Sciences, 72(8): 1386-1396, 2006. 

[21] R. Impagliazzo, R. Paturi, and F. Zane. Which problems have strongly exponential complexity? 
Journal of Computer and System Sciences, 63(4):512-530, 2001. 

[22] M. Krivelevich, Z. Nutov, M. R. Salavatipour, J. Yuster, and R. Yuster. Approximation algorithms 
and hardness results for cycle packing problems. ACM Transactions on Algorithms, 3(4), 2007. 

[23] M. Lampis. Algorithmic meta-theorems for restrictions of treewidth. In Proc. of the 18th Annual 
European Symposium on Algorithms (ESA), volume 6346 of LNCS, pages 549-560, 2010. 

[24] D. Lokshtanov, D. Marx, and S. Saurabh. Known Algorithms on Graphs of Bounded Treewidth 
are Probably Optimal. In Proc. of the 22nd annual ACM-SIAM Symposium on Discrete algorithms 
(SODA), pages 777-789, 2011. 

[25] B. A. Reed, K. Smith, and A. Vetta. Finding odd cycle transversals. Operations Research Letters, 
32(4):299-301, 2004. 

[26] N. Robertson and P. Seymour. Graph Minors. XIII. The Disjoint Paths Problem. Journal of Com- 
binatorial Theory, Series B, 63(1):65-110, 1995. 

[27] N. Robertson and P. D. Seymour. Graph Minors. V. Excluding a Planar Graph. Journal of Com- 
binatorial Theory, Series B, 41(1):92-114, 1986. 

Departement dTnformatique 
Universite Libre de Bruxelles 
Brussels, Belgium 

E-mail address: gjoret@ulb.ac.be 

AlGCo pro,iect-team 
CNRS, LIRMM 

MONTPELLIER, FRANCE 

E-mail address: {paul,sau,thomasse}@lirmm.fr 

The Institute of Mathematical Sciences 
Chennai, India 

E-mail address: saketOimsc. res . in 



